Bayesian Approaches in Speech Recognition

نویسنده

  • Shinji Watanabe
چکیده

This paper focuses on applications of Bayesian approaches to speech recognition. Bayesian approaches have been widely studied in statistics and machine learning fields, and one of the advantages of the Bayesian approaches is to improve generalization ability compared to maximum likelihood approaches. The effectiveness for speech recognition is shown experimentally in speaker adaptation tasks by using Maximum A Posterior (MAP) and model complexity control by using Bayesian Information Criterion (BIC). This paper introduces the variational Bayesian approaches, in addition to the MAP, BIC and other Bayesian techniques, for speech recognition. VBEC (Variational Bayesian Estimation and Clustering for speech recognition) is a fully Bayesian speech recognition framework, and achieves robust acoustic modeling and speech classification. This paper explains the formulation and experimental effectiveness of these Bayesian approaches for speech recognition.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Improved Bayesian Training for Context-Dependent Modeling in Continuous Persian Speech Recognition

Context-dependent modeling is a widely used technique for better phone modeling in continuous speech recognition. While different types of context-dependent models have been used, triphones have been known as the most effective ones. In this paper, a Maximum a Posteriori (MAP) estimation approach has been used to estimate the parameters of the untied triphone model set used in data-driven clust...

متن کامل

Bayesian Approaches to Acoustic Modeling: A Review

This paper focuses on applications of Bayesian approaches to acoustic modeling for speech recognition and related speechprocessing applications. Bayesian approaches have been widely studied in the fields of statistics and machine learning, and one of their advantages is that their generalization capability is better than that of conventional approaches (e.g., maximum likelihood). On the other h...

متن کامل

Testing the Hypothesis of Multivariate Normality in Bayesian Approaches to Speaker Adaptation

Bayesian approaches to speaker adaptation are popular in Automatic Speech Recognition (ASR) systems. In most kinds of Bayesian adaptation, there are parameters whose prior distributions are assumed to be multivariate normal. This paper presents a methodology, which can test the hypothesis of multivariate normality. When applied to Maximum A Posterior (MAP) adaptation, we found that the real pri...

متن کامل

A Bayesian Network View on Acoustic Model-Based Techniques for Robust Speech Recognition

This article provides a unifying Bayesian network view on various approaches for acoustic model adaptation, missing feature, and uncertainty decoding that are well-known in the literature of robust automatic speech recognition. The representatives of these classes can often be deduced from a Bayesian network that extends the conventional hidden Markov models used in speech recognition. These ex...

متن کامل

Off-line Arabic Handwritten Recognition Using a Novel Hybrid HMM-DNN Model

In order to facilitate the entry of data into the computer and its digitalization, automatic recognition of printed texts and manuscripts is one of the considerable aid to many applications. Research on automatic document recognition started decades ago with the recognition of isolated digits and letters, and today, due to advancements in machine learning methods, efforts are being made to iden...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011